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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re patent application of Docket No. : Y09-99-25 1 

David E. Johnson, et al. 

Serial No.: 09/349,494 Group Art Unit: No.: 2654 

Filed: July 9, 1 999 Examiner: EdouanJ, Patrick Nestor 

For: RULE INDUCTION FOR SUMMARIZING 
THE CLASSES IN A CLASSIFIED 
DOCUMENT COLLECTION 

Mail Stop: Non-final amendment 
Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 

DECLARATION UNDER 37 CF.R. S1.131 

Sir: 

We, David E. Johnson and Fredrick J. Damerau, do hereby declare: 

1 . We are co-inventors of the subject matter disclosed and recited in 
independent claims 1, 15, 16, and 21 (and the claims dependent therefrom) of the above- 
identified application. 

2. We completed the invention of claims 1, 15, 16, and 21(and those claims 
dependent therefix>m) in the United States before June 3, 1999, as evidenced below. 

CONCEPTION 

3. Before June 3 , 1 999 we conceived of an apparatus and method for 
providing summaries of documents belonging to a class of a classified document 
collection as disclosed in the present specification and recited in at least independent 
claims 1, 15, 16, and 21 of the application, of which is evidenced by IBM Disclosure 
YQR8- 1999-0410 ( hereinafter referred to as Athe Invention Disclosure®) attached hereto 
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as Exhibit A. The Invention Disclosure attached hereto is a photocopy of and is identical 
to the originals, except that all pertinent dates have been removed therefrom. 

4. All pertinent dates removed fiom the Invention Disclosure attached hereto 
are before June 3, 1999. 



5. As evidenced in the Invention Disclosure, the system and method for 
providing summaries of documents belonging to a class of a classified document 
collection includes at least a method and system for: 



a. inducing a set of rules from a sample set of documents, the set of rales 
being characteristic of the sample input documents, 

b. extracting the set of rules in order to provide a concise description of the 
one or more classes of the sample documents, 

c. comparing extracted words, phrases, and terms appearing in the set of 
rules induced from the sample set of documents to an individual incoming input 
document, and 

d. providing a summary of the individual incoming input document based on 
matches between the extracted wonls, phrases, terms and the individual incoming 
input document. 

6. As further evidenced in the Invention Disclosure, the apparatus and 
method further include a system and computer program code for: 

a. inducing a set of rules from a sample set of documents, the sample set of 
documents belonging to at least one class of a classified document collection and 
the set of rules being characteiisUc of the sample input documents. 
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b. extracting the set of rules in order to provide a concise description of one 
or more classes of the sample documents, 

c comparing the extracted set of rules to at least one individual incoming 
input document, and 

d. providing a summary of each of the at least one individual incoming input 
document based on matches between the set of extracted rules and the individual 
incoming input document. 

7. The benefits and features of the invention are shown and described in the 
Invention Disclosure. 



DUE DILIGENCE 

8. Prior to June 3, 1999, the inventors submitted the Invention Disclosure to 
IBM counsel for review and preparation of a patent application. 

9. Prior to June 3, 1 999. IBM counsel conveyed the Invention Disclosure to 
patent counsel at McGuireWoods, LLP for preparation of a patent application. 

1 0. Prior to the filing of the present application in the U.S. Patent Office, 
Inventor Damerau communicated with patent counsel at McGuireWoods LLP. on behalf 
of all of the inventors, in preparing a patent application based on the Invention 
Disclosure. For example, communications took place at least on June 23, 1999, June 26, 
!999,June28, 1999, JuneSO, 1999, and July 2, 1999, when a final application was 
completed encompassing the invention as described in the attached Invention Disclosure. 

1 1. During this time and prior to June 3, 1999, we worked diligently on 
providing information to IBM in-house counsel and counsel at McGuireWoods, and 
preparing the patent application for filing in the U.S. Patent Office. All of the inventors 
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were involved in reviewing and finalizing the application for the present invention prior 
to the filing of the above-identified application. 

12. We worked diligently on the preparation of the patent application with 
patent counsel at McGtaireWoods until a final draft patent application was completed to 
our satisfaction. A final draft was forwarded to us by McGuireWoods counsel Andrew 
M. Calderon on July 2. 1999. who filed the application, on our behalf, on My 9, 1 999. At 
all times, we worked diligently to finalize the application for filing in the U.S. Patent and 
Trademark Office fix)m prior to June 3, 1 999 to the finalized application on July 9, 1 999. 

13. We declare that all statements made herein of our own knowledge are true 
and that all statements made on information and belief are believed to be true; and 
fiwthcr, that the statements were made with the knowledge that willful felse statements 
and the like so made are punishable by fine or imprisonment, or both, under Section 1 00 1 
of Title 18 of the United States Code, and that such wiUfiil false statements may 
jeopardize the validity of the application or any patent issuing thereon. 

fl'iiK^ 

David E. Johnson \\ ; 



Date 




Fredrick J. Da6ierau " — 



Date 



\\COm\378740.I 
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Disclosure Y0R8-1 999-0410 

Created By: Fred Damerau Created On: 

Last Modified By: David E Johnson Last Modified On: 

*** IBM Confidential*** 



Summary 



Required fields are marlced with the asterisk (*) and must be filled in to comptete the fonn . 



Status 


Submitted 


Processina Location 


YOfl 


Functional Area 


600 Services. Applications & SoluUons-Kovac 


Attorney/Patent 
Professionai 


Stephen C Kaufman/Watson^BM 


Submitted Date 




Owning Division 


RES 


PVT Score 


27 



Inventors with Lotus Notes ID'S 

Inventors: Fred Damerau/Watson/IBM, David E Joiinson/Watson/iBM 



Manager 
Serial 




Inventors without Lotus Notes ID*s 
IDT Selection 



Main Idea 



Rule induction for summarising the classes in a classified document collection 

^^^^^^^^inti<^, statf^ the probiem'solved^^^^ arid indicating the advantages of 

using the invention. 

The invention is a system for providing precise summaries of the documents belonging to each class in a 
classified document collection. An example would be a collection of news stories, each of which is tagged 
as belonging to one or more categories. 

The system provides a short categorisation of each category based on the attributes of the documents 
contained in that category. Another example would be a collection of transcripts of martceting calls, each 
classified as "sale** or "no sale". The system would provide a summary of the attributes of calls which led 
to sales. 

2. How does the invention solve the problem or achieve an advantage,(a description of *1he invention**, 
including figures inline as appropriate)? 

A collection of documents is given to a machine learning system. The machine learning system induces 
mies which provide a characterisation of each class in the collection. These rules are decomposed to 
provide a concise description of the class. 
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3. If the same advantage or problem has been identified by others (inside/outside IBM), how have those 
others solved it and does your solution differ and why is it better? 

The problem of summarisation of individual documents has been addressed many times, usually by 
applying some method to extract sentences from the document which are felt to be particularly salient to 
the document topic. A common heuristic is to use the first sentence of each paragraph. These methods do 
not apply to document collections. An approach used for document collections involves linguistic analysis 
of a document collection to fill pre-defined templates or infonnation structures, and then using natural 
language generation techniques to provide a readable version of the formatted template, Dragomir Radev 
and Kathleen R. McKweon. "Generating Natural Language Summaries from Multiple On-Line Sources". 
Computational Unguistics. Vol. 99. No. 9. 1998. The bibliography of this paper gives references to 
previous work. An example of sentence selection is described in US5689716, 1 1/18/1997. "Automatic 
method of generating thematic summaries". A variant of this approach involving regions rather than 
sentences Is described in US5384703, "Method and apparatus for summarising documents according to 
theme". 

4, If the invention is implemented in a product or prototype. Include technical details, purpose, disclosure 
details to others and the date of that implementation. 

The machine learning component of the invention has been implemented in a prototype of an email 
classification and routing system (Kitcat) which is currently being made into a product by Corepoint. The 
idea of the rule set as a document characterisation tool has not yet been implemented but has been 
proposed as part of the solution for the VECSRI FOAK currently undenway with GBIS. Research and 
Dreyfuss. 

^Critical Questions ( Questions 1 - 7 must be answered) 
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*QuestiQn'2 ' .V 
1$ toere apji^^^ or act^a^public€ttiQn;ip^^ 


O Yes 

• no 

• ' --J-" 




iWBfyqq^w^ji^lf^^^ thjsM'enti&ri^,, ' i^; : ■ 


•YesVj: 

Q no: ; 


If yes.;%TteiAhe f^irne^;Bg^ ■ ' v ^ 

US5689716 Autonfiallc Method :of Generating Tlib^m^ 

Date l^ued:1l/f 8^997 
US5394703 Methoaa!5ind Apparatus for Summarising OoM 

Dat^l8§^:(M/#i9?5 ^ - , . 

eeneratiriig Mai^ 



Has the subject rnatter of thf jnyention or a pfoil^d incorporating^^^ Invention been spld, 
used irtehislte In mainufact^ emoiiifm^ a pnoposaR* 


O Yes 
• no 


Is a sale, use in nianufacturing, product announoemei^^dr proposal planned? 


• Yes 

Ono 
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or planned dam Pf $3!erannpunoemer 


Its, or 
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™Ml''!iS^^:'M^^^^^ Incxjrporatte.your inventiort used in 

paKi6<|^ite^«ah^iiy^i^^^ ...... r- 
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Have yA^i^pliscussed your inveritibg iiiStk ot^e^ at IBM? 


O Yes 
•-No 


SfeMSS^Hf ^fi^^^^'§^^-P in the text area with the fpllowing information. 
r!am<^i^g^indiwlu&ls, tf^^^^ #«ssed, under^pA, ahd COA #. ^ 
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Patent Value Tool (Optional - this may be used by the Inventor and attorney to assist with the evalu 

(The Patent Value tool can be used by you or the evaluation team to determine the potential licensina 
value of your invention.) 

These are the answers which were entered Into the Patent Value Tool. 
Market 

What is the anticipated annual market size (in dollars) that will be captured by your invention? 
$10Mto$100M 

Question 1 - How new is the technical field? 
Emerging 

Question 2 - How central is the invention to the product(s) which might be expected to contain the 

invention? 

Main 

Question 3 - What is the scope of the claim? 
Fundamental 

PQRTf=QUQMegD ■■^■■H 

What are the portfolio needs In the area of your Invention? 
Usted In PPM Needs 

EXPLOITATIOM & ENFQRCgMFNT 

Question 1 - How easily can the use of the invention by a competitor be detected? 
With work 

Question 2 - How easily can the use of the invention be avoWed by a competitor? 
With much work 

BUSINESS VAUIF 

Question 1 - What percentage of the companies producing products in the field of this invention miaht use 
this invention? 
Broadly cloned 

Question 2 - What is the value of this patent to cun^ent or anticipated Alliance Activity between IBM and 
other companies? 
Some value 

Question 3 - What is the value of this patent to cun-ent or anticipated Technology Transfer Activity 
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between IBM and other companies? 
Some value 

Question 4 - Does it result in prestige to IBM? 
industry wide 

Post Disclosure Text & Drawings 

(Fomi Revised 12^17/97) 
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